The use of air-pressure sensor in electrolaryngeal speech enhancement based on statistical voice conversion
نویسندگان
چکیده
In our previous work, we proposed a speaking-aid system converting electrolaryngeal speech (EL speech) to normal speech using a statistical voice conversion technique. The main weakness of our system is the difficulty of estimating natural contours of the fundamental frequency (F0) from EL speech including only built-in F0 contours. This paper proposes another speaking-aid system with an air-pressure sensor to enable laryngectomees to control F0 contours of the EL speech using their breathing air. The experimental result demonstrates that 1) the correlation coefficient of F0 contours between the converted and the target speech is improved from 0.58 to 0.78 by the use of the air-pressure sensor and 2) the synthetic speech converted by the proposed system sounds more natural and is more preferred to that by our conventional aid system.
منابع مشابه
A digital signal processor implementation of silent/electrolaryngeal speech enhancement based on real-time statistical voice conversion
In this paper, we present a digital signal processor (DSP) implementation of real-time statistical voice conversion (VC) for silent speech enhancement and electrolaryngeal speech enhancement. As a silent speech interface, we focus on nonaudible murmur (NAM), which can be used in situations where audible speech is not acceptable. Electrolaryngeal speech is one of the typical types of alaryngeal ...
متن کاملElectrolaryngeal speech enhancement based on statistical voice conversion
This paper proposes a speaking-aid system for laryngectomees using GMM-based voice conversion that converts electrolaryngeal speech (EL speech) to normal speech. Because valid F0 information cannot be obtained from the EL speech, we have so far converted the EL speech to whispering. This paper conducts the EL speech conversion to normal speech using F0 counters estimated from the spectral infor...
متن کاملA Hybrid Approach to Electrolaryngeal Speech Enhancement Based on Noise Reduction and Statistical Excitation Generation
This paper presents an electrolaryngeal (EL) speech enhancement method capable of significantly improving naturalness of EL speech while causing no degradation in its intelligibility. An electrolarynx is an external device that artificially generates excitation sounds to enable laryngectomees to produce EL speech. Although proficient laryngectomees can produce quite intelligible EL speech, it s...
متن کاملA hybrid approach to electrolaryngeal speech enhancement based on spectral subtraction and statistical voice conversion
We present a hybrid approach to improving naturalness of electrolaryngeal (EL) speech while minimizing degradation in intelligibility. An electrolarynx is a device that artificially generates excitation sounds to enable laryngectomees to produce EL speech. Although proficient laryngectomees can produce quite intelligible EL speech, it sounds very unnatural due to the mechanical excitation produ...
متن کاملStatistical Voice Conversion Techniques for Alaryngeal Speech Enhancement
This position paper gives a brief overview of our developed technologies for enhancing alaryngeal speech (AL speech) uttered by laryngectomees. There are several alternative speaking methods for laryngectomees to produce AL speech. However, any type of AL speech suffers from lack of naturalness and speaker individuality (identity). To address this issue, we have developed statistical voice conv...
متن کامل